Speakers Determination and Isolation from Multispeaker Speech Signal
نویسنده
چکیده
In this letter, we address the issue of determining the number of speakers from multispeaker speech signals collected simultaneously using a pair of spatially separated microphones. The spatial separation of the microphones results in time delay of arrival of speech signals from a given speaker. The differences in the time delays for different speakers are exploited to determine the number of speakers from the multi speaker signals. The key idea is that for a given speaker, the relative spacing’s of the instants of significant excitation of the vocal tract system remain unchanged in the direct components of the speech signals at the two microphones. The time delays can be estimated from the cross-correlation of the Hilbert envelopes of the linear prediction residuals of the multi speaker signals collected at the two microphones. Keywords— Excitation Source, Hilbert Envelope, Linear Prediction Residual, Multispeaker speech signal, time-delay estimation
منابع مشابه
Separation of Multispeaker Speech Using Excitation Information
In this paper, we propose an approach for separating speech of individual speakers from a multispeaker speech signal using excitation source information. The proposed approach is demonstrated in a two-microphone case. The main issue in the two-microphone case is the estimation of delay of each speaker. We propose a method for delay estimation in multispeaker case using the knowledge of excitati...
متن کاملEnhancement of speech in multispeaker environment
In this paper a method based on the excitation source information is proposed for enhancement of speech, degraded by speech from other speakers. Speech from multiple speakers is simultaneously collected over two spatially distributed microphones. Time-delay of each speaker with respect to the two microphones is estimated using the excitation source information. A weight function is derived for ...
متن کاملApplying Blind Signal Separation to the Recognition of Overlapped Speech
Blind signal separation method based on minimizing mutual information is applied to deal with multispeaker problem in speech recognition. Recognition experiments performed under di erent acoustic environments, in a soundproof room and a reverberant room, clarify that 1) the method can improve recognition accuracy by about 20% where SNR condition is 0 dB, 2) the method is more e ective when many...
متن کاملCrosscorrelation-based multispeaker speech activity detection
We propose an algorithm for segmenting multispeaker meeting audio, recorded with personal channel microphones, into speech and non-speech intervals for each microphone’s wearer. An algorithm of this type turns out to be necessary prior to subsequent audio processing because, in spite of close-talking microphones, the channels exhibit a high degree of crosstalk due to unbalanced calibration and ...
متن کاملModelling speaker intelligibility in noise
This study compared listeners’ performance on a multispeaker speech-in-noise task with that of a model inspired by automatic speech recognition techniques. Listeners identified three keywords in simple 6-word sentences presented in speech-shaped noise at a range of signal-to-noise ratios. Sentence material was provided by 18 male or 16 female speakers. An across-speaker analysis of a number of ...
متن کامل